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Abstract. We prove that a class of A-stable symplectic Runge-Kutta time 
semidiscretizations (including the Gauss-Legendre methods) applied to a class 
of semilinear Hamiltonian PDEs which are well-posed on spaces of analytic 
functions with analytic initial data conserve a modified energy up to an ex- 
ponentially small error. This modified energy is 0(/i p )-close to the original 
energy where p is the order of the method and h the time step-size. Examples 
of such systems are the semilinear wave equation or the nonlinear Schrodinger 
equation with analytic nonlinearity and periodic boundary conditions. Stan- 
dard backward error analysis does not apply here because of the occurrence 
of unbounded operators in the construction of the modified vector field. This 
loss of regularity in the construction can be taken care of by projecting the 
PDE to a subspace where the operators occurring in the evolution equation 
are bounded and by coupling the number of excited modes as well as the num- 
ber of terms in the expansion of the vectorfield with the stepsize. This way 
we obtain exponential estimates of the form 0(exp(— /3/h 1 /( 1 + 9 ))) with > 
and q > 0; for the semilinear wave equation, q = 1, and for the nonlinear 
Schrodinger equation, q = 2. 



Contents 

1. Introduction 

2. Backward error analysis for Hamiltonian ODEs 

2.1. Implicit Runge-Kutta methods for ODEs 

2.2. Embedding of the method into a flow 

2.3. Modified Hamiltonian 

3. Semilinear Hamiltonian PDEs 

3.1. Semilinear evolution equations 

3.2. Differentiability of the semiflow 

3.3. Hamiltonian structures on Hilbert spaces 

3.4. Spaces of analytic functions 

3.5. Functional setting for the semilinear wave equation 

3.6. Functional setting for the nonlinear Schrodinger equation 

3.7. Nonlocal Schrodinger equation on a star-shaped domain 

4. A-stable Runge-Kutta methods on Hilbert spaces 

5. Backward error analysis for time-semidiscretizations 

5.1. Statement of the main result 

5.2. Galerkin truncation 

5.3. Embedding of semidiscretizations into a flow 



2 
5 
5 
G 
7 
9 
9 
10 
11 
14 
15 
1G 
17 
17 
20 
20 
22 
24 



Date: August 23, 2012. 



1 



2 



C. WULFF AND M. OLIVER 



5.4. Backward error analysis on Hilbert spaces 

Acknowledgments 

References 



28 
33 
33 



1. Introduction 



When discretizing conservative dynamical systems, it is often desirable to pre- 
serve their invariants or proper discrete analogs of such invariants. It is usually not 
possible to find a method that preserves all invariants, but there are often methods 
which preserve invariants approximately over very long times. 

Symplectic integrators are a class of such methods that has been extensively 
used and studied in the context of Hamiltonian ordinary differential equations 
(ODEs). They are designed to preserve the symplectic form of the Hamiltonian 
system exactly and it can be shown that they also approximately preserve energy 
over exponentially long times, see e.g. [HI [T5l [HJ [33] and Section [2] below. The 
main technical tool used in the proof of such results is backward error analysis. 
Backward error analysis seeks a modified Hamiltonian vectorfield which at time h 
(the stepsize of the numerical method) approximates the one-step integrator ip h 
with stepsize h to very high accuracy and, on the other hand, exactly preserves 
a modified Hamiltonian H which, for a method of order p, is 0(h p )-c\ose to the 
Hamiltonian H of the Hamiltonian system to be discretized. 

Results of this type go back to Neishtadt [25], who showed that an analytic 
diffeomorphism which is 0(/i)-close to identity can be embedded into a flow up 
to an exponentially small error 0(e~ h °/ h ) by taking an optimal choice of N = 
0(1/ h) averaging steps of a normal form transformation. Benettin and Giorgilli [1] 
give an alternative construction, more suited towards the application to numerical 
integrators, by matching a Taylor expansion of a symplectic diffeomorphism t/j h 
with the formal power series expansion of the flow of an arbitrary ^.-dependent 
Hamiltonian vector field, truncating at some order N, carefully estimating the 
remainders, and concluding that the truncation is optimal when N = 0(1/ h). This 
method has been used to prove approximate energy conservation of many classes 
of symplectic numerical methods [14] [15] [T9l HI] . In particular symplectic Runge- 
Kutta discretizations ip h (-) of analytic Hamiltonian ODEs approximately conserve a 
modified energy H(x) with exponentially small error 0(e~ h °/ 2h ) over exponentially 
long times 0(e h °/ 2h ) provided that the discrete trajectory remains in a bounded 
set, see Section [5] and in particular Theorem l2.3l for a more detailed review of those 
results. 

In the case of partial differential evolution equations (PDEs), the phase space 
is typically an infinite dimensional Hilbert space and the vector field contains un- 
bounded operators, usually in the form of spatial derivatives. This poses serious 
difficulties for backward error analysis. The flow map of the PDE usually ceases 
to be analytic, or even differentiable with respect to h and typically the same ap- 
plies to the numerical time-h map i/j h used to semidiscretize the PDE in time; the 
best one can generally guarantee is continuity in h. In other words, the formal 
power series expansion of ip h in h now contains powers of unbounded operators, 
and this propagates into the formal series expansion of the modified vector field. 
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Note, that the analytical difficulties persist when analyzing full space-time dis- 
cretizations of the problem. In this case, the unbounded operators corresponding 
become matrices with norms diverging to infinity as the spatial resolution increases. 
Consequently, the constant ho in the exponential error estimate 0(e~ h °/ h ) tends 
to with increasing spatial resolution, so that backward error analysis for general 
initial data without the requirement of high regularity leads to severe restrictions 
in the time stepsize. In the case of hyperbolic problems such as the semilinear wave 
equations, the standard approach fails for step size ratios close to the CFL limit, 
i.e., in the practically relevant regime. 

Nonetheless, we can show that backward error analysis of this type can be used 
for PDEs provided the solution to the full PDE has high regularity. Time semidis- 
cretizations of semilinear Hamiltonian PDEs are typically only well-defined if the 
method is fully implicit; explicit or partially implicit methods such as partitioned 
Runge-Kutta methods, the simplest of which are the leap frog and symplectic Euler 
schemes, violate the CFL condition [17]. Thus, in this paper we consider a class 
of (implicit) symplectic A-stable Runge-Kutta methods which includes the Gauss- 
Legendre Runge-Kutta methods. The simplest of these methods is the implicit 
midpoint rule. 

Our approach applies to a large class of semilinear Hamiltonian PDEs with 
analytic nonlinearities, including the semilinear wave equation and the nonlinear 
Schrodinger equation on the circle. Our main result, Theorem 15.11 can be para- 
phrased as follows. If a semilinear Hamiltonian evolution equation with energy 
H(-) is discretized by a symplectic A-stable Runge-Kutta method ^ h of order p, 
then there exists a modified energy H(-) which is defined for Gevrey regular data, 
is 0(hP) close to -ff (•), and is conserved by the symplectic integrator ^> h with expo- 
nentially small error 0(exp(-/3//i 1 /(«+ 1 ))). Here, q > is determined by the PDE, 
while f3 depends on the PDE, the regularity of the initial data, and the integrator. 
This result implies that if the solution to the continuum problem remains Gevrey 
regular over a finite interval of time, then, by consistency of the scheme, the dis- 
cretization in time preserves a modified energy with an exponentially small error on 
the same time interval (Corollarv l5.5[) . Moreover, provided the numerical trajectory 
U n = (^S/ h ) n (U°) remains Gevrey regular, the numerical energy is approximately 
conserved over exponentially long times (Remark 15. 2p . 

Let us mention some related results: Moore and Reich [24] made a first step 
towards a formal backward error analysis for multisymplcctic discretizations of 
semilinear wave equations and their corresponding energy and momentum conser- 
vation laws. They derive a modified higher order multisymplcctic partial differential 
equation which is satisfied by the numerical solution with higher accuracy than the 
discretization error. See also the related work of Islas and Schober [18]. In [26], it is 
shown that second order finite difference space-semidiscretizations of analytic solu- 
tions of the semilinear wave equation approximately conserve a discrete momentum 
map up to an exponentially small error. Cano [3 a considered symplectic space-time 
discretizations of semilinear wave equations and constructed a finite order modified 
Hamiltonian, assuming certain conjectures on the smoothness of the fully discrete 
system. 

There has been a lot of recent progress on the approximate conservation of 
invariants by geometric integrators for Hamiltonian PDEs based on normal form 
transformations or modulated Fourier expansions. These results are strong in the 
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sense that they provide a detailed analysis which does not depend on extraneous 
regularity assumptions, however, by the techniques of proof are limited to small 
initial data, i.e., they apply in a weakly nonlinear regime, or they require a modified 
modified splitting method which dampens highly oscillatory motion. 

Dujardin and Faou 6 prove approximate energy conservation of certain splitting 
methods applied to linear Schrddinger equations. Due to the linearity of the PDE, 
regularity of the numerical solution, if satisfied at the initial time, is guaranteed 
for all times. As a consequence, the numerical method ip h conserves a modified 
Hamiltonian H over exponentially long times. Debusche and Faou [5] prove that 
splitting methods applied to linear Schrddinger equations, where the Laplacian part 
is discretized by the implicit midpoint rule, conserve a modified energy exactly. 
Note that, in contrast, (symplectic) Gauss-Legendre Runge-Kutta discretizations 
of linear Hamiltonian systems preserve energy exactly |15j . 

In the nonlinear case, Faou, Grebert, and Paturel [8] prove conservation of ac- 
tions of numerical trajectories of Hamiltonian evolution equations associated with 
splitting methods for small initial data over polynomially long times under generic 
non-resonance conditions for full-space time discretizations by using normal form 
theory under a condition coupling spatial and time stepsize that excludes time- 
semidiscretizations. For "rounded" splitting method which involve a cut-off in the 
high frequencies, they prove a similar result for time-semidiscretizations which im- 
plies long-time existence of regular numerical trajectories [9]. Faou and Grebert 
[7] show approximate energy conservation over exponentially long times for new 
splitting methods which dampen highly oscillatory terms of the linear part of the 
Hamiltonian evolution equation. 

Cohen et at [4] use modulated Fourier expansions to prove approximate conser- 
vation of energy and of the actions for semilinear wave equations over polynomially 
long times 0(l/h k ) with small smooth initial data and full space-time discretiza- 
tions which are symplectic trigonometric integrators in time and spectral space 
discretizations. They do not obtain exponentially accurate estimates; but, assum- 
ing a non-resonance conditions which we do not require, they obtain boundedness 
of the numerical solution over polynomially long times, using the method of mod- 
ulated Fourier expansions. A similar result for nonlinear Schrddinger equations is 
obtained by Gauckler and Lubich [T2] . 

In the case of nonlinear ordinary differential equations, a numerical method ip h 
conserves a modified Hamiltonian in general only up to an exponentially small 
error [15] : error bounds over exponentially long times require the assumption of 
boundedness of the numerical solution in a suitable norm which is in general not 
satisfied for nonlinear PDEs. So the analysis of approximate energy conservation 
of symplectic discretizations of PDEs entails two problems: one is the construction 
of a modified Hamiltonian which is conserved by the numerical method with an 
error which is of high order for sufficiently smooth initial data (in this paper we are 
only interested in exponentially small errors); the second problem is to ensure that 
there exist numerical trajectories which remain bounded in the required norms 
over exponentially long times (see also the discussion above). We decouple the 
question of long time existence of numerical trajectory and existence of modified 
Hamiltonian. Although this paper primarily addresses the latter question, we think 
it is conceivable to obtain Nekhoroshev type estimates which imply existence of 
Gevrey regular numerical trajectories of Runge Kutta semidiscretizations in time 
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of semilinear Schrodinger equations over exponential long times. Such results have 
been obtained for the continuous problem in [10] . It might also be possible to obtain 
existence results for Gevrey-regular finite-dimensional invariant tori of numerical 
methods the existence of which has been obtained by Kuksin and Poschel [30] and 
Poschel [31] . Combined with our results, this would imply approximate energy 
conservation of A-stable Runge-Kutta semidiscretizations in time over exponential 
long times. 

The method we use is related to techniques for averaging of rapidly forced Hamil- 
tonian PDEs of Matthics and Scheel [23 . The analysis relies on our earlier work 
earlier work [27] [28] on A-stable Runge-Kutta methods for semilinear evolution 
equations: In [27], we analyzed the differentiability properties in the initial value 
and in time of the semiflow of 

d t U = F(U) = AU + B(U) (1.1) 

on a scale of Banach spaces, and obtained analogous results for the time-/i map of 
its corresponding A-stable Runge-Kutta time-semidiscretization. In [28] , we prove 
stability of the semiflow and of the time-semidiscrete solution under spatial spectral 
Galerkin approximation. 

The paper is structured as follows. In Section [2] we review backward error anal- 
ysis for symplectic discretizations of Hamiltonian ODEs. In Section[3]we introduce 
in more detail the class of semilinear Hamiltonian PDEs for which we carry out 
our error analysis and show that the semilinear wave equation and the nonlin- 
ear Schrodinger equation belong to this class. In Section [4] we introduce A-stable 
symplectic Runge-Kutta methods which are well-defined on Hilbert spaces. In 
Section [5] we present and prove our main results, Theorem 15.11 on approximate 
energy conservation of symplectic A-stable Runge-Kutta methods, as described in 
Section 2] applied to the class of PDEs introduced in Section [3] 

2. Backward error analysis for Hamiltonian ODEs 

In the following, we review the method of backward error analysis for symplectic 
Runge-Kutta methods applied to Hamiltonian ordinary differential equations. We 
largely follow the notation and presentation of [15] . but make explicit the depen- 
dence of the estimates on the magnitude of the vector field. 

Throughout the paper, we write 

B*(U ) = {UeX:\\U-U°\\ x <R} 

to denote the closed ball of radius R in a Banach space X about U G X . When 
no confusion about the space is possible, we may drop the superscript X . 

2.1. Implicit Runge Kutta methods for ODEs. Consider an autonomous or- 
dinary differential equation 

y = f(y) (2.1) 

defined on the closed ball B r (y°) C K m of radius r around y° where / is analytic 
and satisfies the estimate 

||/(y)|| <M for yeB r (y°). 

Let 0* : B r (y°) ->■ R m denote the flow of (|2T]) . i.e. y(t) = <p\y n ) solves ([2T]) with 
y(0) = y°, and let ip h : B r (y°) R' m denote a one-step method applied to (|2.1|) . 
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As shown in (TSJ Theorem IX. 7. 2] (with r replaced by 2r) via Cauchy estimates, 
the s-stage Runge-Kutta method 

s 

w l =y +h^2a i: j f{w j ) for i=l,...,s, (2.2a) 

S 

*P h (y Q )=y Q + J2 b *fW) ( 2 - 2b ) 

8=1 

can be expanded in a converging power series in h, 

oo 

^ h (y) = v + J2 hj 9 j (y)- ( 2 - 3 ) 

For future reference, we define the method-dependent constants 

?? = 2max(||a||,||b||/(21n2-l)) and 7 = e(2 + 1.65?? + ||b||) 

where 

s s 

||b||=^|bi| and ||a|| = , max g ^|a y | 
i=\ 1 '"' ,s 3=1 

2.2. Embedding of the method into a flow. The key result on which backward 
error analysis of such one-step methods is based is the embedding of the numerical 
time-/i map into the flow of a modified vector field. The precise statement is the 
following. 

Theorem 2.1 ([HI Theorem IX.7.6]). In the notation of Section PHI let ip h be 

an s-stage Runge-Kutta method applied to (|2.1[) . Then for every h £ [0, ho/4] with 
ha = r/(2ei]M) there exists a modified differential equation y — f(y), defined on 
B r /n(y°) whose flow 4> l satisfies (^(y ) € t3 r /4(y°) for at least < t < h and 

U h (y°) - <W)II <h 7 Me- h °/ h . 

The proof shall not be repeated in detail here. But we note, for later reference, 
that the modified vector field is constructed as a power series in h, 

n-l 

/ n (y;/ i ) = /(t/) + ^^7-'"+ 1 (y), (2.4) 

3=V 

where p is the order of the numerical method. Its exponential map is then expanded 
in powers of h and matched term by term with the expansion of the numerical time- 
h map (|2.3[) . This yields a recursive expression for the coefficient vector fields [TBI 
Lemma IX. 7. 3]: 

f'Up U'UD XjT E (D fcl ---D fci _ 1 / fc *)(y), (2.5) 

i=2 ' k 1 + ...+k i =j 

for j > 2 where ki > 1 for all i, and T>ig(y) = ^>g{y)f l {y) is a short hand notation 
for the Lie derivative with respect to the ith. coefficient vector field. The proof of 
Theorem [2T] proceeds by carefully estimating the growth of the f 3 , noting that the 
optimal truncation is achieved when n — n(h) = [ho/h\. (For r el, the number 
n = [rj denotes the largest integer n < r.) When referring to the optimally 
truncated vector field, we write fh or just /. 
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One crucial estimate, as proved in [15], reads 

\\f(y)\\ < (1 + 1.657?) M for y e 6 r/4 (y°) . (2.6) 

A related estimate, which guarantees consistency of the truncation, is the following. 
We provide an explicit proof as the dependence of the estimate on M and r is not 
spelled out in the literature in the detail we require. 

Lemma 2.2. In the notation of Section PO} let p > 1 and < h < ho /A. Then 
there exists a constant c = c(rj,p) which only depends on the method such that 

\\f(y) -f(y)\\<cr- p M p+1 hP for y e B r/4 (y°) . 

Proof. It is known [15j Theorem IX. 7. 5] that 

\\f(y)\\<\n2r,M(?^y for y € B r/4 (y°) . (2.7) 

Applying this estimate to (|2.4j) and using that n < h /h and therefore h < h /n, 
we find that 

n-1 



Wf(y)-f(y)\\<hPj2 hJ ~ p \\f 3+1 (y)\\ 

>M(j + l) 



n-1 

< h p ^2h J - p \n(2)nM 



r 



< 



h p ln{2)r)AlY^ 



n-1 / \ j-p 



J=P 



r V p {2nM(j + I) 



2erjMn J \ r 



3=P 

< ln(2)ryM (^^-J e p+1 p) , (2.8) 

where, in the last inequality, we have bounded the first factor inside the sum by 1 
and noted that j p e _J is decreasing for j > p, so that 

+1 < x p e~ x dx< x p e~ x dx = pl. 
j=p &3 Jp Jo 

This completes the proof. □ 

2.3. Modified Hamiltonian. To proceed further, we assume that the differential 
equation is Hamiltonian, i.e. of the form 

y = f H (y)=3VH(y) (2.9) 

with m even, where J is a skew-symmetric, invertible m x m matrix, the symplectic 
structure matrix. The function H : 1" — > K is the Hamiltonian or energy of the 
system and is an invariant of the motion. Hamiltonian systems have a symplectic 
flow map, i.e., (j/ satisfies 

(D,0 t ( 2/ )) T J- 1 D^*(y)=J- 1 (2.10) 
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for all y and t for which this relation is well defined [2Tj . A numerical one-step 
method is called symplectic if, when applied to a Hamiltonian ODE, its time-/i 
map tp h is symplectic. 

It is known that a Runge-Kutta method of the form (12.21) is symplectic if its 
coefficients satisfy 

b l 3ij + bj aji — bi bj = 
for i,j = l,...,s; see, for example, [15] . The simplest example of a symplectic 
Runge-Kutta method is the implicit midpoint rule, given by 

which, equivalently, can be written in the form of a general Runge-Kutta scheme 
(HH) with s = 1, an = ±, and bi = 1. 

For symplectic methods it can then be shown that the modified vector field 
is also Hamiltonian [14j [15l [TH [33] . More specifically, JT — 1 D/^(#) is a symmetric 
matrix for all j > p and y € B r u{y ), so that we can obtain a Hamiltonian for the 
vectorfield p via 

W(y)= (\y-y Q J- l P{y a + t{y~y°)))&t (2.11) 
Jo 

for j > p and \\y — y°\\ < r/4, i.e., p = JJViP. Then the optimally truncated 
modified Hamiltonian reads 

n-1 

H{ •;h) = H + W H3+1 > ( 2 - 12 ) 

3=P 

so that 

/ = JVi? . 

(Again, we refer to the optimal truncation by dropping the superscript of a general 
modified Hamiltonian H n .) This construction implies the 0(h p )-closeness of the 
modified to the original Hamiltonian. On the other hand, the modified Hamiltonian 
is exactly conserved on the modified flow which is exponentially close to the discrete 
time-ft. map. This proves approximate energy conservation of symplectic numerical 
methods. 

Theorem 2.3 (|T5l Theorem IX. 8.1]). Consider a Hamiltonian system (|2.9[) on 
some open, simply connected set D C K 2m with analytic Hamiltonian H : D — > K 
and apply a symplectic Runge-Kutta method ip h with stepsize h. So long as the 
numerical solution y n = (i/j h ) n (y°) remains in some compact set K C D then, for 
ho and H as above, 

H(y n ) = H(y°) + 0{e- ha ' 2h ) 
over the exponentially long interval of time nh < e h °/ 2h . 

Together, (|2.12l) and Theorem 12.31 imply that the numerical method possess an 
(9(/i p )-approximate energy invariant which is preserved over time scales which are 
exponentially long in 1/h. 

In this paper we prove an analogue of Theorem 12.31 for a general class of general 
semilinear Hamiltonian evolution equations, including nonlinear Schrodinger equa- 
tions and semilinear wave equations on bounded star-shaped domains. The main 
problem we face is that the assumptions of Section 12.11 cannot be satisfied with a 
finite value of M. We will address this problem in Section O 
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3. Semilinear Hamiltonian PDEs 

We now provide the framework necessary to extend the results of the preceding 
section to the case of semilinear Hamiltonian systems on Hilbert spaces. We begin 
by reviewing the general functional setting for semilinear evolution equations. In 
Section 13.21 we review differentiability in time of the semiflow. In Section 13.31 we 
restrict to the Hamiltonian case and review a well-known integrability lemma in 
our Hilbert space setting. Section 13.41 introduces Hilbert spaces of analytic func- 
tions and superposition operators on these spaces. Finally, in Sections 13.51 and 13. 61 
respectively, we show how our main examples, the nonlinear Schrodinger equation 
and the semilinear wave equation, fit into this framework. 

3.1. Semilinear evolution equations. We initially consider an abstract semilin- 
ear evolution equation of the form (|1.1[) , 

d t U = F(U) = AU + B{U), 

which formally resembles (|2.1j) but will be thought of as being posed on an infinite- 
dimensional Hilbert space y. To make this distinction apparent, we use upper case 
letters for phase space variables in infinite dimensions and lower case letters for 
phase space variables in finite dimensions. We assume the following. 

(A) A is a normal operator on a Hilbert space y which generates a C°-semigroup 
e tA . 

(BO) B : V — > y is analytic on an open set V c J 7 ; 
Recall that an operator A is normal if it is closed and AA* = A* A. For a definition 
of strongly continuous semigroups (C°-semigroups), see [29] . 

Condition (A) implies that there is a constant uj > such that \\e tA \\ < e* w for 
all t > 0. Then, after casting (|l.lj) in its mild formulation 

U(t)=e tA U°+ [ e^ A B(U(s))ds, (3.1) 
Jo 

we can apply the contraction mapping theorem with parameters to obtain well- 
posedness locally in time [HI (59]. Let U° -> &(U°) denote the flow of (|3"7Tj) . i.e., 
U(t) = $*(J7°) satisfies JED with 17(0) = U°. 

For m € N, let P m denote the sequence of spectral projectors of A onto the set 
spec A n B£(0), set P = Pi, and Q = 1 — P. Assumption (A) implies that 

lim P m U = U 

m— foo 

for all U e y, and 

\\AP m U\\y<m\\¥ m U\\ y (3.2) 

for m G N. Let q > 0, r > and I G N . Since A is normal, \QA\ e exp(r|QA| 1 /9) is 
a well-defined, generally unbounded and densely defined operator on y. We may 
thus introduce the abstract Gevrey space 

y r ,i = y TAq = D(\QAf cxp^lQAI 1 / 9 )) (3.3) 
equipped with the scalar product 

(Ui,U 2 )y Tj = (VU 1 ,¥U 2 )y 

+ (\QAf expfrlQAI 1 ^) QU lt \QA\ e exp^lQAI 1 ^) QU 2 )y . (3.4) 
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Gevrey-smooth functions U G y r .e are exponentially well approximated by their 
Galerkin projections P m U. Indeed, setting Q m = id— P m , 

\\Q m U\\ y < m ~ e exp(-rm 1 /«) \\U\\ y ^ . (3.5) 

Moreover, this definition of the norm ensures that 

PHdw^<i and H^k. < H^k^ ( 3 - 6 ) 

for all U G y r ,t+i- For convenience, we define 3^ = yo,i- We can then state the 
following lemma. 

Lemma 3.1. For a > r and p G No, 

Proof. For p = 0, there is nothing to prove, hence let p > 0. For fixed Z7 € 

ll^lk,, = W FU Wy + II I W c(T ~ CT) ' QA|1/ ' IQ^f e CT l QA l 1/9 Q[/|ft . (3.8) 

The function /(A) = \\\ p e( T ~ a ^ x \ 1/9 is non-negative and has a global maximum at 
A* = {jpq/{p — r)) q . Replacing the corresponding term in (|3.8p by its maximum 
value, we obtain (|3.7[) . □ 

3.2. Differentiability of the semifiow. To obtain a flow of the evolution equa- 
tion which has higher order time derivatives, as required in ScctionJU we need more 
specific assumptions on the regularity of B on the scale of Hilbert spaces defined 
above. 

We use the following convention to denote derivatives. Given any Hilbert space 
X, open set T> C X, and map Z: D — > R, we write DZ(U) to denote the derivative 
of Z at U e P as an element of X* , and by V ' Z(U) the canonical representation of 
BZ(U) by an element of X. In other words, DZ(U)W = (VZ(U),W), where (•, •) 
denotes the inner product on X. 

For Banach spaces X and Z, and j G No, we write £ J (y, X) to denote the vector 
space of j-multilincar bounded mappings from y to X; we set £ J (X) = £i(X,X). 
Moreover, when U C X is open and k G N, we write C^{U, Z) to denote the set of 
k times continuously differentiable functions F:U-^?Z whose derivatives D l F are 
bounded as maps from U to £ t (X, Z) and extend to the boundary of U. 

Finally, for Banach spaces X, y, and Z, and open subsets U C X, V C y, and 
W C Z, we write 

FeC (m,n) {u x V;W) 

to denote a continuous, bounded function f:MxV^W whose partial Frechet 
derivatives D X D Y F(X, Y) exist, are bounded, and are such that the maps 

(X, Y, X u . . . , Xt) h-> D x & y F (X, F)(X X , . . . , X,) 

are continuous from W x V x A!' into £i(y, Z) for i = 0, . . . , m and j = 0, . . . , ra, 
and extend continuously to the boundary. 

Given 8 > and a family of open sets T>i C 3^ for £ = 0, . . . , L for L G N, we 
define 

X>7 5 Ejf/eP,: dist y< ([/, dV e ) > 6} . (3.9) 
Then, by construction, Bf{U) C V t for all J7 G V^ 5 and £ = 0, . . . , L. 
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Let y± be a Banach space continuously embedded into the Banach space J^. 
Then V x C ^l is called a (5,-nested subset of V C y if V^ 5 C V~ s for all 5 € [0, £*]. 
Furthermore we say that the family V , . . . ,V l is <5*-nested if VJ 5 C £>7-i f° r au 
5 G [0, 5*} with o* > and £ = 1, . . . ,L. For example, the family V k = B% k (U°) is 
<5*-nested for every o* e (0, i?) and U° eji. However, an arbitrary nested family 
V( C may not be o*-nested for any 5* > 0. 

We now make the following assumption on the nonlinearity of our semilinear 
evolution equation. 

(Bl) There exist K 6 No, N € N with N > K, and a <5*-nested sequence of 34- 
bounded and open sets Vk such that B € C^~ h (Vk,yk) for k = 0, . . . ,K. 

We denote the bounds of the maps B: Vk — > 34 and their derivatives by con- 
stants Mfe, M' k , etc., for fc = 0, . . . , K. In addition to the domains Vq, . . . , Vk de- 
fined in (Bl), we will sometimes refer to Vk+i which may be any J^if+i-boundcd, 
open, and <5*-nested subset of Vk, and define 

R K+1 = sup \\U\\ y (3.10) 
uev K+1 

We can then quote the following theorem on the uniform regularity of the flow 
[27] Theorem 2.6 and Remark 2.8]. 

Theorem 3.2 (Regularity of semiflow) . Assume (A) and (Bl). Choose S 6 (0,(5*] 
small enough such that V K S +1 ^ 0. Then there exists T* > such that the semiflow 
(U,t) h-> $*([/) of dm]) satisfies 

$e p| cf^xp.T,);}^). (3.11) 

j+k<N 
^<fc<if+l 

Moreover, $ maps V K+l x [0,T„] into Pk- TTie bounds on $ and T* depend only 
on the bounds afforded by (Bl), (|3.10l) . a;, and on <5. 

Remark 3.3. The precise form of assumption (Bl) is motivated by the typical case 
where B is a superposition operator of a function /: D C M. d — >• R m and 3^ is 
related to the standard Sobolev space Then, if / is (N + 1) times continuously 
differentiable on some open set D C K d , it is N times differentiable as a map from 
the open set V of Hi to Hi- Here « G V ensures that u(x) £ D pointwise. Moreover 
/ is (N — k) times differentiable from V n Hk to Hk — see [371 Theorem 2.12] and 
also [23 Remark 7.4]. 

For the results of Section [5[ we need regularity of the flow on a space of Gevrey- 
regular functions as well. Hence, we assume the following. 

(B2) There exist r > 0, q > 0, L > 0, and an 34,1,-bounded open set V t ,l which 
is a oVnested subset of Vk+i such that B £ C\{V T ^, 34. z,)- 

We note that y T ,L C 3 7 if+i due to Lemma [3.11 In the following, 2? t ,l+i refers 
to an arbitrary }V,L-bounded, open, and o*-nested subset of XV,£- We note that 
under assumption (B2), Theorem 13 .21 applies with 34,l in place of y. For examples, 
see Sections [3.51 and |3"1H 

3.3. Hamiltonian structures on Hilbert spaces. Our main result, the back- 
ward error analysis of Section[SJ requires that ([l.ip is Hamiltonian, i.e., there exists 
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a symplectic structure operator J and a Hamiltonian H : V — > R such that 

d t U = AU + B(U) = JVH{U). (3.12) 

In addition to (A) and (BO), we assume the following. 

(HO) The symplectic structure operator J is a closed, skew-symmetric, densely 

defined, and bijective linear operator on y. 
(HI) A is skew-symmetric and J _1 A is bounded and self-adjoint on y. 
(H2) For every U G T>, the operator S~ 1 DB(U) is self-adjoint on y. 
(H3) V is bounded and star-shaped. 

We first remark that, by the closed graph theorem, (HO) implies that J is in- 
vertible with J" 1 G £{y). This implies, in particular, that §~ 1 DB(U) is a bounded 
operator on y for every U G T>. 

Second, recall that an operator A is skew if A* = - A and D(A) = D(A*). This 
implies that spec(^4) C iK and that, by Stone's Theorem, A generates a unitary 
C°-group on y-, see, e.g. [32]. If A = A s + A^ where A s is skew and A\> is bounded, 
we can redefine B as B + A\> and A as A s to satisfy (HI). This situation is typical 
for semilinear wave equations, see Section 13.51 

Third, by conditions (A), (HO), and (HI), 

JpM = (jrM)* = A*!' 1 = -AT 1 = AT 1 . (3.13) 

Hence, A and J -1 commute, which also implies the following. 

Lemma 3.4. Assume (A), (HO), and (HI). Then J _1 P m = P m J -1 for all m G N Q . 

Proof. By (|3.13|l . A and J" 1 commute, and so do F(A) and J -1 for all analytic 
functions F. Approximating characteristic functions xa of measurable sets AcC 
by analytic functions, we see that xa{A) and JT _1 also commute [32] . With A = 
B^(0), this implies that xa{A) = P m commutes with J -1 . □ 

Finally, we recall that a subset S of a linear space is star-shaped if there exists 
U* € S such that for every W G S, the line segment C/*W is contained in S. We 
then say that S is star-shaped with respect to U* ; the set of all points with respect 
to which S is star-shaped is denoted ckS, the convex kernel of S. 

The existence of a Hamiltonian H is then guaranteed by the following integra- 
bility lemma. 

Lemma 3.5. Assume (A), (BO) and (HO-3). Then there exists an analytic Hamil- 
tonian H : T> — > M for the evolution equation (|l.lj) . Moreover, if (Bl) /lo/iis trae, 
thenH eCg(D;R). 

Proof. We seek a Hamiltonian of the form 

ff(f/) = i([/,J- 1 ^[/) + V(f/). (3.14) 

Due to (A), (HO), and (HI), the quadratic part of the Hamiltonian is well-defined 
and possesses the properties claimed. 

To proceed, we first suppose that T> is star-shaped and fix U° G ckT>. We set 

V(U)= [ (r 1 B(tU+ (1 -t)U°),U- U")dt, (3.15) 
Jo 
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so that, for W e y, 



r-i 

(VV(U),W)= I (r 1 B(tU + (1 -t)U°),W)dt 







+ * / (J- l TfB(t U + (l-t) U°)W, U - U°) dt 
Jo 

* d 

— (tr 1 B{tu + (i -t) u°),w) dt, 



where the last equality is due to the self-adjointness of J~ 1 DB(U). Then, by the 
fundamental theorem of calculus, B(U) — JW(U). Further, (j3. 15[) shows that 
analyticity of B implies analyticity of V, and uniform bounds on B imply corre- 
sponding uniform bounds on V . □ 

In the construction of the modified Hamiltonian in Section we will need to 
shrink domains of definition. Therefore, we need conditions under which the do- 
mains T>J S remain star-shaped. We make use of the following characterization of a 
star-shaped subset S of a linear space [31]. Let {C a : a £ A} denote the collection 
of all maximal convex subsets of S, which is nonempty. Then 

ckS = f]C a and S= [j C a . 

We now prove a result which shows that so long as the convex kernel remains 
nonempty under domain shrinking, the reduced set remains star-shaped. 

Theorem 3.6. Let y be a Banach space and T> C y star-shaped with (ckT>)~ s 
nonempty. Then T>~ 6 is star-shaped. 

Proof. Let {C a ■ a 6 A} denote the collection of all maximal convex subsets of T>. 
Then {C~ 5 : a e A} is a collection of convex subsets of S s . We claim that it is 
the collection of all maximal convex subsets of S~ s up to boundary points. (It is 
generally not true that a maximal convex subset of an open set is open, hence this 
qualification.) 

To prove this claim, let E denote an arbitrary maximal convex subset of S~ s , 
and set 

E +s = [j iatBs(x), (3.16) 

xeE 

where intD denotes the interior of a set V. Let F D E +s denote a maximal convex 
extension of E+ 5 in S. Clearly, int E C F~ s C S" 5 . 

Vice versa, suppose x € F~ s \ cl E. Since the convex hull of {x} U E is contained 
in S~ s , this contradicts the maximality of E up to boundary points. We conclude 
that int-E = F~ s . I.e., {C~ 5 : a € A} is the collection of all convex subsets of S~ s 
which are maximal up to boundary points, and 



n c- g c ck(s- 5 ) c n cic-' 



i-6 

q£A a£A 



Since, by assumption, (ckS) _d is nonempty, there exists an i <= ckS satisfying 
dist(x,9ck5) > S. Clearly, x € C a with dist(x, dC a ) > S for every a G A. Thus, 
x e C~ s and we conclude that ck(5 _(5 ) is also nonempty. This implies that S~ s is 
star-shaped. □ 
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For bounds on the modified Hamiltonian in Section [5] we will need (H3) for S £ 
(0, S*] on at least two scale rungs, so that, for simplicity, we assume the following. 

(H4) Each T>k for k = 0, . . . , K is bounded and star-shaped with (ck2?/ £ )~ <5 * non- 
empty. 

The following are examples of hierarchies of sets which satisfy (H4) in general 
Banach spaces. 



Example 3.7. The domains T>k are open 34 balls int(S^ fc (0)) with radii Rk 



< ... < 



o- 



Example 3.8. Let T> — T>q be a bounded star-shaped domain, fix K £ N, and let 
R > sufficiently large and 6* > sufficiently small such that (ckT>)~ s ' K ^ 0, 
V C B%{U°) for some [7° € (ckT>)- s ' K n y K and R > K5* > 0. Then, for 
= 0, ...,K, the domains 2? fc = V~ s * k n int^^L^E/ )) are nested and star- 
shaped with (ckT>k)~ s * 7^ 0. By Theorem 13.61 the sets V^ s are star-shaped for all 
8 £ [0,(5*] and, by construction, 

Hence, T>o, . . . , T>k are <5*-nested and (H4) holds. 

In the next section we introduce concrete function spaces and superposition op- 
erators on these spaces in order to verify that our main examples, the nonlinear 
Schrodinger equation and the semilinear wave equation, fit into our abstract frame- 
work. 

3.4. Spaces of analytic functions. We denote the Fourier coefficients of a func- 
tion u £ £ 2 (S 1 ; C d ) by u k , so that 

u (z) = -L^H fc e ite . (3.17) 

v 2n fcez 

Let Q T j = Gr.ei^ 1 ', C d ) denote the Hilbert space of analytic functions u £ £ 2 (S 1 ; C d )) 
for which 

\H\g Ti t = ( u ,u)g^ e < oo, 
where the inner product is given by 

(u, v)g Tj = (il V ) C 4 + e2T ' fc| ^)c d ■ ( 3 - 18 ) 

feez 

It can be shown that Qr^S 1 ; C") contains all real analytic functions whose radius of 
analyticity is at least r. In particular, functions in ^^(S 1 ; C d ) can be differentiated 
infinitely often. This follows from Lemma [3.11 with y — £ 2 (§ 1 ;C d ) and A = id x . 
We write rl l = Qo,e to denote the usual Sobolev space of functions whose weak 
derivatives up to order £ are square-integrable. 

The additional index £ in Q Ty i is important because of the following. 

Lemma 3.9 ( (111 Lemma 1]). The space ^^(S^C) is a topological algebra for 
every r > and £ > 1/2. Specifically, there exists a constant c — c(£) such that for 
every it, v £ G T ,t(§> ] C) the product uv £ G T ,e{^> , C) with 

HHI^s^C) ^ c \\ u \\g T , e (s 1 ;C) ll U llg T ,i(Sl;C) • ( 3 - 19 ) 
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To treat general nonlinear potentials in (|3.22p . we need to consider superposition 
operators /: Q T ,i — > Q T ,i of analytic functions. The following lemma is a minor 
adaptation of results proved in [TT] [23] . 

Lemma 3.10. If f: C d — > C d is entire then f is also entire as a function from 
G T ,e(^> 1 ',C d ) to itself for every r > and £ > 1/2. // / is analytic on B r (uo) C C d 
where uo G C d then f is analytic and uniformly bounded from Br(uq) C Q t ,i (S 1 ; C d ) 
to ^-/(S 1 ; C d ) for any R < r/c, where c — c(£) is the constant from Lemma \3.'A 



Proof. Let / be entire and let 

oo 

f(z) = J2 a n (z-u ) n (3.20) 

n=0 

be the Taylor series of / around uq £ C. Let <f>: M — > M be its majorization 

oo 

^( s ) = ^|a„| S ". 

By applying the algebra inequality (|3.19|) to each term of the power series expansion 
(|3.20jl of f(u), we see that the series converges for every u £ Q T ,t provided r > 
and I > 1/2, and that 

H/Wlk,, < c- 1 <f>(c \\u - u \\g j + lavKV^-c- 1 ) , (3.21) 

where c is as in Lemma 1331 see [IT] . In other words, / is entire on ^/(S 1 ). 

When / has only a finite radius of analyticity, we argue as follows. Assume that 
\f{ z )\ < M on B^(uq). Then, by Cauchy's estimate, 

. M 
l a »l — ~^ ■ 

Consequently, the majorant <f> is bounded on any (0) with p < r with uniform 
bound 

M 

lL 1-p/r' 

Due to (|3.2ip . the superposition operator / is then analytic and bounded by 

M spp = n/c + \a \ (v 7 ^ - c- 1 ) 

as a map from a ball of radius R = p/c around uq £ G T j into G T j. □ 

3.5. Functional setting for the semilinear wave equation. Consider the semi- 
linear wave equation 

d tt u = d xx u-V'{u) (3.22) 
on the circle S 1 = IR/27rZ. Its Hamiltonian can be written 



H(u,v) 



s 1 



\v 2 + \{d x uf + V{u) 



d.r; 



where v = d t u. We write U = {u,v) T and set y = Hi{S 1 ;R) x £ 2 (§ 1 ;R) so that 
the Hamiltonian is well-defined on y. For U — (u,v) £ y let PqU = (pou,pot>) 
where for u £ £ 2 (§ 1 ;M) we define pqu = uq. Setting 

id 

81 
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we then define 

A = P A , B{U) = (_v>( u )) + P ° Au > ( 3 - 23 ) 
and the symplectic structure matrix J via 

(r 1 U 1 ,U 2 )y= I {u l v 2 -U2V l )Ax (3.24) 



for all Ux = (u 1 ,v 1 ) T ,U 2 = (u 2 ,v 2 ) T G y. 

Since the Laplacian is diagonal in the Fourier representation (|3 . 1 7(1 with eigen- 
values — k 2 for k £ Z, the eigenvalue problem for A separates into 2x2 eigenvalue 
problems on each Fourier mode, and spec A = iZ\{0}. Clearly, A is skew-symmetric 
on y with respect to the inner product (I3.18|) . Note that PoA has a Jordan block 
and is hence included with the nonlinearity B. Thus, 

with q=l. 

The symplectic structure operator J defined by (|3.24p is an unbounded operator 
on y Tt i with domain y T ,i+i- It is possible, though not necessary for anything which 
follows, to compute J -1 explicitly. Namely, (|3.24[) reads 

(3~ 1 Ui,U 2 )y = ({^~ 1 U 1 ) u ,u 2 )h 1 + ((3~ 1 Ui) v ,v 2 ,)c 2 = / (uiv 2 - u 2 V!) dx . 

(3.25) 

The definition of the inner product (|3.18l) implies 

((Jr X l7l)t,,U2>Wi = (((P0-^)(J _1 ^l)»^2)^ 5 

so that p.25p splits into 

(p -d 2 )(r 1 U 1 ) u = - Vl and (JT 1 ^)* = u x . 
We conclude that 

j->=(; -(po-^)- 1 ). 

If the potential V : D — > R is analytic on an open set D C R, then, by Lemma [3.101 
B is analytic from £?^ r ^(£/°) to y r ,i for any r, I > and U° — (u°,v°) provided 
Bf(u°) C D, R < r/c(£) as in Lemma [3. 101 and v° € C 2 arbitrary. In this setting, 
all the above assumptions are satisfied. 

3.6. Functional setting for the nonlinear Schrodinger equation. Consider 
the nonlinear Schrodinger equation 

i d t u = -d xx u + duV{u, u) (3.26) 

on the circle § , where V(u,u) is analytic in Re it and Imu. Setting U = u, we can 
write 

A = id 2 , B(U) — —iduV(u,u) , (3.27) 

and, similarly to (|3.24|l . 

(J _1 Ui, u 2 )y = J Re(kiiu 2 ) dx. (3.28) 
The Hamiltonian is given by 

H(U) = \ f (\d x u\ 2 + V{u,u)) dx (3.29) 
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for all u%,U2 € J 7 = H 1 (S 1 , C). The Laplacian is diagonal in the Fourier representa- 
tion (|3.17|) with eigenvalues — k 2 . Hence, spec A = {— ifc 2 : fceZ} and A generates 
a unitary group on £ 2 (S X ;C) and, more generally, on every Q Tt i with £ € No and 
r > 0. Further, y r>i = Q^u+tiS 1 ; C) with q = 2. 

If the potential V : D C R 2 — > K is analytic as a function of (q, p) = (Re w, Im u), 
then, by Lemma I3.10[ the nonlinearity B(U) defined in (|3.27j) is analytic as map 
from a ball in ^/(S 1 ; C) to itself for every r > and I > 1/2. The construction of 
the domain hierarchy works as in Section 13.51 so that all of the above assumptions 
can be satisfied. 

We remark that if we were to write out the nonlinear Schrodinger equation in real 
coordinates with U = (Re u, Im«), the structure operator J would be the canonical 
symplectic matrix on M 2 . 

3.7. Nonlocal Schrodinger equation on a star-shaped domain. Consider the 
nonlocal nonlinear Schrodinger equation 

id t u = -d xx u + V'^j \u\ 2 dxju (3.30) 

with periodic boundary conditions. It is Hamiltonian with symplectic structure as 
above and energy 

H(U) = lJj d * u \ 2dx + V (\ fj H 2 d*) • 

In this case, 

<-2-n 



and A is as above. Let, for example, V(r) — 1/r. Then B is analytic as map 
from y T j \ {0} to y Tt £ where y Tt g = Gr,2£+i as before. Let {ek}kez denote a 
^-orthonormal basis of eigenvectors of A. Fix R > and define 

T> = int(S^(0)) \ C_ where C- = {u = y^u n ej : up < 0, \\u — upepHy < \up\} . 

Then T> is star-shaped with respect to any u in the cone 

C+ = {u = ^ u o e j '■ u o > 0, \\u - u e \\ y < \u \} n B%(0) . 

jez 

Moreover, for some <5* > depending on R, C+ s is nonempty for all S € (0,(5*). 



Fixing a suitable > 0, K € N, and defining the sets Vq, . . . , T>k as in Examplcl37 
we obtain a <5*-nested domain hierarchy which satisfies (Bl) and (H4). If we define 
V+ = hit (S^(0)) \C+ then we can define <5*-nested domains Vq , . . . , 2?J satisfying 
(Bl) and (H4) as above such that T>UV + is the punctured open disk int(B£(0))\{0}. 



4. A-STABLE RUNGE-KUTTA METHODS ON HlLBERT SPACES 

In this section, we introduce a class of A-stable Runge-Kutta methods which are 
well-defined when applied to the semilinear PDE under assumptions (A) and 
(B0), and review some regularity and convergence results for those methods from 
[27] . In this section, we need not assume that (|1.1|) is Hamiltonian. 
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Applying an s-stage Runge-Kutta method of the form (|2.2[) to the semilinear 
evolution equation , we obtain 



W = U° 1 + h a (AW + B(W)) , 
U 1 = U° + hb T (AW + B(W)) . 



(4.1a) 
(4.1b) 



For U £ J, we write 



1U 



e y s , w 



'w 1 



,w s 



B(W) 



(BiW 1 ) 



XBiW 8 ), 



where W 1 , . . . , W s are the stages of the Runge-Kutta method, 

s s 

(aW) 1 = a ij W° , b T W = W j , 

3=1 3=1 

and A acts diagonally on the stages, i.e., (AVK) 1 = AW 1 for i = 1, . . . , s. 
A more suitable form, required later, is achieved by rewriting (14. la)) as 



W = n{W; U, h) = (id -haA)- 1 (1U + haB(W)) 



and 



(4.2) 



(4.3) 



*' l (l0 = S(hA)U + hb T (id -haA)- 1 B(W(U, h)) , 

where S is the so-called stability function 

5(z) = 1 + zb T (id -za)' 1 1 . (4.4) 

We now make a number of assumptions on the method and its interaction with 
the linear operator A. First, we assume that the method is A-stable in the sense of 
[20j . Setting C~ = {z 6 C: Rez<0}, the conditions are as follows. 

(RK1) The stability function flU]) is bounded with \S(z)\ < 1 for all zeT. 
(RK2) The s x s matrices id— za are invertible for all z E C~. 

We also require two further conditions. 

(RK3) The matrix a is invertible. 

(RK4) The method is symplectic, i.e., satisfies (|2.10l) . 

Gauss-Legendre Runge-Kutta methods satisfy conditions (RK1-4); see [271 Lemma 
3.6] for condition (RK1-3) and 15 for (RK4). 

In the following, we also need to refer to a set of key estimates on the linear 
operators which appear in the formulation (|4.2I) and (|4. 3[) of the Runge-Kutta 
method, namely 

\\(id-haA)-% a ^y 
WhaAiid-haAj^W 



\\S(hA)\\ 



< 1 



< A, 

y. < 1 + A , 
<7 h < cs , 



(4.5a) 
(4.5b) 
(4.5c) 



for all h £ [0, /i*] and constants A > 1, er > 0, and cs > 1. These estimates naturally 
hold true on each rung of our hierarchy of spaces. Their proofs are given in [27j 
Section 3.2]. 

A-stable Runge-Kutta methods have the remarkable property that their time-/i 
map is of the same regularity class as the flow of the evolution equation stated in 



APPROXIMATE ENERGY CONSERVATION OF TIME SEMIDISCRETIZATIONS 



19 



Theorem 13.21 We state this result as an abbreviated version of [27j Theorem 3.15 
and Remark 3.17]. 

Theorem 4.1 (Regularity of numerical method). Assume (A), (Bl), and (RK1- 
3). Pick S £ (0,(5*] such that T>~^ +1 is nonempty. Then there exists /i* > such 
that the components W J of the stage vector W(U, h) and numerical method "J are 
of class (|3.11|) with T* there replaced by /i* here. Moreover, "J and W J map into 
T>k- The bounds on W , W and h* only depend on the bounds afforded by (Bl) and 
(13.101) . on the coefficients of the method, on the constants afforded by (|4.5p , and on 
S. 

Remark 4.2. Even when B : y — > y is analytic, the numerical time-h map ty h (U) is 
not analytic in h unless U and the image of B are restricted to a finite dimensional 
subspace. Take, for example, the linear Schrodinger equation, i.e. equation (|3.26l) 
with B = 0, discretized by the implicit mid point rule. Then 

1 _i_ ±hk 2 i 

h M- S(hA) e,. = (id +\hA) (id -\hA)- 1 e k = - 1— e k 

_L 2" /i/C 1 

has radius of analyticity \h\ < f , where e k is the fc-th Galerkin mode of A as de- 
scribed in Section 13.61 Therefore, if the Fourier expansion of U does not terminate 
finitely, then fy h (U) = S(hA)U cannot be analytic in h. This argument applies 
to any A-stable Runge-Kutta method: Since \S(z)\ < 1 for all z € iM by assump- 
tion (RK1), the stability function is a rational polynomial S(z) = P(z)/Q(z) with 
degQ > deg P. Hence, degQ > 1 so that 5(z) has at least one pole zq. The radius 
of analyticity of S(z) around is ro = |zo|j so that h i— > S(hA)e k cannot be analytic 
outside a ball around h = of radius \h\ < ro/k. As for the implicit midpoint rule, 
this implies that ^ h (U) is not analytic in h unless the Fourier expansion of U is 
finite. 

Thus, while differentiability in h can be obtained by stepping down on a scale of 
Hilbert spaces, analyticity can only be obtained by projecting onto a subspace on 
which the vector field is bounded. This will become necessary in Section [5.31 where 
analyticity is essential for obtaining exponential error estimates. 

In [571 Theorem 3.20], we proved convergence of the time semidiscretization. 
Condition (A2) of [37] is always satisfied in the setting of this paper and stated as 
(|4.5cp above, so that the convergence theorem, which will be needed later on, can 
be stated as follows. 

Theorem 4.3 (Convergence of time semidiscretization). Assume (A) and apply 
a Runge-Kutta method of classical order p subject to conditions (RK1-3) to it. 
Assume further that (Bl) holds with K > p. Pick 8 G (0,5*] such that is 
non-empty and fix T > 0. Then there exist positive constants h*, c\, and C2 which 
only depend on the bounds afforded by (Bl) and (|3.10[) . on the coefficients of the 
method, on the constants from (|4.5j) . and on 5, such that for every U° with 

{$*(C7 ):t€ [0,T]}cP-£ (4.6) 

and for every h £ [0, h*\, the numerical solution (^> h ) n (U ) lies in T> and satisfies 

|| (q h ) n (u°) - $ nh (U )\\y < c 2 e cinh h p 

so long as nh < T. 
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5. Backward error analysis for time-semidiscretizations 

5.1. Statement of the main result. We are now ready to state and prove our 
main result on the backward error analysis of semilinear Hamiltonian evolution 
equations. As before, we write j — \r\ to denote the largest integer j < r, and 
3 = l~ r l to denote the smallest integer j > r. 

Theorem 5.1 (Main theorem). Assume that the semilinear Hamiltonian evolution 
equation (|3.12[) with energy (|3. 14|) satisfies (A), (B0-2) ; and (HO-4). Apply a 
a symplectic Runge-Kutta method of order p > 1 and stepsize h which satisfies 
(RK1-4) to (f3TT2|) . Assume further that K > a + b, 

a=\p{q + l)+q] and b=\p(q+l)/q] (5.1) 
with q defined in (B2). Then for each sufficiently small S £ (0,5*] there exists 
/i* > and a modified energy H : T> 1 S ' 2 x [0, h*] — > K which is analytic in U for 
each h £ [0, h*] and satisfies 

sup \H(U,h)-H(U,h) \ = 0(h p ). (5.2) 

uev-l b 

Moreover, the modified energy is approximately conserved with exponentially small 
error in the sense that there is c* > such that 

sup \H($ h (U),h)-H(U,h)\=0(e- c * h ~ T ^ ! ). (5.3) 

For the semilinear wave equation, q — 1 (see Section 1575"]) . so that the exponent 
in (|5.3|) scales like h^ 1 / 2 . In the case of nonlinear Schrodinger equations, q = 2 (see 
Section [376]) . so that the exponent in t|5 ,3|) scales like /i" 1 / 3 . 

The proof of this theorem will be developed as a series of Lemmas in Sections l5.2l - 
15.41 below, the two claims corresponding to Lemma 15.171 and Lemma 15.101 respec- 
tively. Note that, due to (B2), V~ S L+1 C C *D~? b , so that the supremum in 
(|5.2|) can be taken, in particular, over I^i+i- 

Remark 5.2. If the entire discrete orbit {U 1 = (^ h ) j (U°) : j £ N } lies in V~ S L+V 
then a bound of the form ()5.3[) holds, in particular, for all U — W so that the energy 
remains approximately conserved over exponentially long times. This condition is, 
for example, satisfied when the discrete orbit is contained in a finite-dimensional 
invariant set for ^ which could be obtained by discretizing a periodic orbit or 
another finite-dimensional invariant set of the semilinear evolution equation (jl.lj) . 
Kuksin and Poschel [30] and Poschel [31] , for example, have used KAM methods 
to prove existence of finite-dimensional invariant tori for the nonlinear Schrodinger 
equation and the semilinear wave equation, respectively. It is conceivable that such 
methods could be extended to discrete symplectic maps 1 $ h . 

Remark 5.3. It has been shown for certain types of nonlinear Schrodinger equations 
that solutions with small enough initial data remain analytic over exponentially 
long times under generic non- resonance conditions [10 . It is conceivable that such 
results also hold for A-stable Runge-Kutta time semidiscretizations of nonlinear 
Schrodinger equations. In |12] such results are shown for splitting methods and in 
[4] for trigonometric integrators applied to the nonlinear Schrodinger equation, but 
only over polynomially long times. 
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Remark 5.4. When the evolution equation Ijl.ljl is linear, e.g., a linear wave equation 
or linear Schrodinger equation, its Hamiltonian is conserved exactly since symplectic 
Runge-Kutta methods conserve quadratic invariants [TS] . 

When an orbit of the full semilinear evolution equation lies in T>~ S L+1 over a finite 
interval of time, consistency of the numerical method implies that the discrete orbit 
will lie in a slightly larger subset of a Hilbert space of less regularity provided ft 
is sufficiently small. Then the conclusions of Theorem 15.11 hold true at least on 
this finite interval of time. To make this argument rigorous, we need to invoke 
our convergence result, Theorem 14.31 with y T ,L+i at the base of the scale, which 
requires a further assumption. 

(B3) There exist r > 0, q > 0, L > 0, and a sequence of <5*-nestcd y T ,L+k- 
bounded and open sets T> Ti L + k such that B 6 C^~ k (T> T ^L+ki OA-.Z+fc) for 
fc = 0,1,2. 

Further, let T> t ^l+3 denote any <5*-nested subset of 2? t ,l+2 which is bounded and 
open in JV.L+3- 

Corollary 5.5. Under the assumptions of Theorem \5. 1\ suvvose. in addition, that 
(B3) holds true. Fix T > 0, 5 > 0, and e > such that e + S < <5*. Then there 
exists ft.* > and a modified Hamiltonian H : T>~ L+1 x [0,ft»] — > M such that for 
any U with 

{^(U°):te[0,T}}dV-{- + % (5.4) 

and for any h G [0, ft*], 

\H(U Q ,h)-H(U ,h)\=O(h p ), 
the discrete trajectory satisfies CP = (ty h y (U°) € T^rL+i' and for fixed f3 € (0,c*), 

\H(U j ,h)-H(U°,h)\ =0(e-' 3h ~ T ^) 

for j = 0, . . . , \ T jh\ . The order constants in the two estimates are uniform over 
all U° satisfying (|5^]) . 

Proof. We first apply Theorem 14.31 with p=l, y T ,L+\ in place of y, and with S + e 
in place of S. We conclude, in particular, that there is /i* > such that for any 
h € (0, ft*], we have U j G T>- S L+1 so long as < j < [T/h\ . 

Next, Theorem 15.11 asserts that there there is a (possibly decreased) ft* > and 
a modified Hamiltonian H : T>~ L+1 x [0, ft*] — > M such that (|5.3p holds true, and 
we estimate 

| J ff(^)-H([/ )|<^|H([/ J+1 )-i?(^)|<ft- 1 O(e~ c *' 1 TT? ) = 0( e -' 3 ' 1 TT? ) 

for any /3 G (0, c*). □ 

The remainder of the paper is devoted to the proof of Theorem 15.11 It is based 
on an embedding result, Lemma T5.81 which generalizes Theorem 12. II to the Hilbert 
space setting. However, it is not possible to directly apply the theorems of Section[21 
because the formal expansions of both, the numerical method and the modified 
vectorfield, contain powers of the unbounded operator A. Therefore, the modified 
vectorfield cannot be written as a semilinear Hamiltonian evolution equation of the 
form p,12p . If we were simply interested in constructing the modified vectorfield, 
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we could get around this issue by setting up different spaces for domain and range 
such that the modified vcctorficld, computed up to a given order, is continuous. 
In fact, we need such techniques to obtain the optimal order estimates for the 
modified Hamiltonian which is yet to be constructed, see Section 15.41 In such a 
setting, however, we do not have a theory of local existence of solutions for the 
modified differential equation so that we cannot obtain a flow. 

We thus resort to the following construction. In Section I5.2( we truncate the 
evolution equation p,12j) to the subspace F m y. Then, in Section 15 . 31 we obtain 
a modified flow on this subspace and relax the cut-off m in a controlled way to 
obtain an embedding result for the time-semidiscretization. Finally, in Section [5.4[ 
we prove estimate (|5.2[) . 

5.2. Galerkin truncation. For given to € N, we define a truncated Hamiltonian 
evolution equation by restricting the Hamiltonian phase space to the finite dimen- 
sional subspace P m y. Since VH\y m y = P m V7J and J -1 leaves P m y invariant by 
Lemma 13.41 the corresponding restricted evolution equation reads 

u m =JP m VH(u m ). 

Thus, setting f m = P m F and B m = P m B, we can write 

u m = fm{ u m) = Au m + B m (u m ) . (5-5) 

We denote the flow of the projected system on V m y by For convenience, we 
set $*„ = o P m . Similarly, let w m denote the stage vector, for j = 1, . . . , s 
its components, and ip!^ denote the numerical time-h map obtained by applying an 
s-stage Runge-Kutta method to the projected semilinear evolution equation (|5.5[) . 
and abbreviate — w 3 m o P m and "J^ = ipm ° ^m- 

In [25], we proved that all of the maps above — the truncated flow <J>^ and the 
components of the stage vector and the time-h map of the truncated 
system — are of the same class (|3.11|) as the exact flow $* with m-independent 
bounds. The precise statement is as follows. 

Theorem 5.6 (Regularity of flow and numerical method of projected system). 
Assume (A), (Bl), and (RK1-3), and choose a 5 G (0,(5*] such that T>~^ +1 ^ 0. 
Then there are positive X 1 ,, h*, and to* such that for every m > to* the flow 
is of class (13. lip , and the components of the numerical stage vector and the 
numerical time-h map are of the same class, but with T* replaced by h*, with 
bounds which are independent of m > to*. Moreover, <E>^ i; W^, and map "Dj^.^ 
into T>k ■ 

Note that and are analytic in t resp. in h so long as B is analytic on y. 
However, the radius of analyticity is generally not uniform in to. 

Next, we present an exponential error bound for the projection error for the 
numerical scheme; this is necessary for obtaining an exponential backward error 
analysis in Section [5.31 below. 

Lemma 5.7 (Exponential projection error estimate for the numerical scheme). 
Assume that the semilinear evolution equation (jl.ip satisfies conditions (A) and 
(BO-2). Let, as before, ^ h and denote a single step of a Runge-Kutta method 
subject to (RK1-3) applied to the full and the projected semilinear evolution equa- 
tion, (jl.lj) and ()5.5|) . respectively. Then for 5 € (0,5*] there are positive constants 
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h*, m*, and c* such that for all m > m*, h 6 [0, h*\ and U € 2? T 

||* ft (t/) - ^(^b, < c* m- L e-™ 1/5 . (5.6) 

Proof. By Theorems 14.11 and 15.61 applied with y r i, in place of y and K = 0, there 
exist /i* > and m* > such that 

$\<,F',^6C b (^ +1 x [o,/i.];y r , L+ inxv, £ ) 

for j = 1, . . . , s with bounds which are uniform in h € (0, h*] and m > m*. 

We first estimate the difference of the stages vectors W(U) — w m (U), noting that 

W{U) ^(id-haA)- 1 (1U + haB{W{U))) 

W m {U) ^{id-haA)- 1 (P m lU + hP m aB(w m {U))) . 

Taking the difference of both expressions, we obtain 

W(U)~w m (U) = (id-haA)- 1 (Q m lU + ha [B{W(U)) -F m B(W m {U))]) . (5.7) 

By 

Wh^d-haAYHBiWiU)) - ¥ m B(Wm(U)))\\ ys 

<hA\\a\\ \\B{W{U)) -F m B{W m {U))\\ ya . 
Using the triangle inequality and the mean value theorem, we further estimate 

\\B(W(U)) -F m B(W m (U))\\y, 

< \\Q m B(W(U))\\ys + \\F m B{W{U)) - F m B(W m (U))\\y s 

< \\Q m B(W(U))\\y, + Mq \\W(U) - W m (U)\\ y , . (5.8) 
Taking the y s norm of (|5.7I) . using (|4.5aj) . and inserting (I5.8[) . we obtain 

\\W(U) - W m (U)\\y. < A ||Q m Z7|| y . + h ||a|| A \\Q m B(W(U))\\ ye 
+ hM l a \\a\\A\\W{U)-W m {U)\\ ys . 
This proves that, for /i* < l/(Afp||a|| A), 



\\W(U)-w m (U)\\ ys < A- 



QmU\\ y , A \\Q m B(W(U))\\ ys 



fi,*M^A||a|| ' '"" """ " 1 - KM^A\\a\ 



We apply (|3.5j) to the first term on the right and note that, again by (|3.5|) . 

\\Q m B(W(U))\\y 3 < m- L e~ rml/q M t ,l , (5.9) 

where M Tj £ is the bound for the norm of B : 2? t ,l yr.L afforded by assumption 
(B2). This establishes that there exists a constant cw such that 

\\W(U) - W m (U)\\ ys < c w m~ L e~ TmUq (5.10) 

for all m > m* and h g [0, h*]. (As in [2S], we could obtain a stage vector error 
bound in the yf-norm by applying A onto (|5.7p and using (|4.5b[) . but this is not 
necessary for what follows.) 

To estimate the difference between the Runge-Kutta updates, we write them in 
the form (|4.3j) such that the respective right hand sides are (uniformly) bounded 
operators, 

V h (U) = S(hA) U + hb T (id -haA)- 1 B(W{U)) , 
^ n (U) = S(hA) F m U + hb T (id -haA)- 1 F m B(w m (U)) . 
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Then, 

* h (U)-*l n (U) =S(hA)Q m U+hb T (id-haA)- 1 [B(W(U))-P m B(w m (U))] . 
Inserting (|5.9p and (|5 . 10[) back into (|5.8j) . we also find that 

\\B(W(U)) - P m B(w m (U))\\ ys < c B m- L e"™ 17 ' (5.11) 
for some constant cb > 0. We note that Q3.4I) and (I4.5b|) imply 

Whaiid-haAy^ly^ys < 2 + A. 
This and the invertibility of a, assumption (RK3), then yield 

< (l + ah)\\Q m U\\ yi + ||b|| Ha- 1 !) \\ha(;Y&-haA)-\B(W(U))-¥ m B(w m {U)))\\ys 

< (1 + ah) \\Q m U\\ yi + (2 + A) ||b|| Ha- 1 !) \\B(W(U)) - F m B(w m (U))\\ ys . 

Inequality (|5.6p is now a consequence of Q3.5P and (|5. 1 1|) . □ 

5.3. Embedding of semidiscretizations into a flow. We now apply the back- 
ward error analysis of Section[5]to the sequence of truncated problems denned above 
where, for each m, we work on the finite dimensional space V m y endowed with the 
3^1 -norm. 

By construction, /,„ is analytic on T> n P m y for each m € N and satisfies the 
estimate 

||/m(«m)||yi < ll^mH^ + \\B m (u m )\\ yi 

< sup \\AP m U\\ yi + m sup||P m B(lO||v <c F m (5.12) 
U£T>! uev 

for u m Gfn P m T>i with 

c F = sup 11/7)1^ +M , 

where the supremum is finite as T>\ is bounded. 

Setting M = cpm, Theorem 12.11 asserts that the numerical time-/i map can be 
embedded into a modified flow up an error which is exponentially small in the step 
size h, albeit not uniformly in m. If, however, we make the stronger assumption 
that the initial data lies some Gevrey space y T ,L+i with r > 0, Lemma 15.71 asserts 
that the numerical solution to the full semilinear evolution equation (jl.lj) remains 
exponentially close in the spectral cutoff m to the numerical solution of the pro- 
jected system. Thus, we can carefully choose m = m(h) to balance the projection 
error and the embedding error bounds to obtain an embedding result on the Gevrey 
space which is still exponential in h, but at a lesser rate. This is done in the next 
lemma, where we also show that the result can be formulated not only on balls as 
in Theorem 12.11 but also on a more general m- independent subdomains of J^i as 
needed in the proof of Theorem 15.11 

Following the notation of Section [2 we denote the coefficients of the power 
series expansion of by g 3 m , the expansion coefficients of the modified vector 
field defined via (2.5j) by f-] ri , and seek an optimally truncated modified vector 
field of the form 

17-1 

/£(u m ; h) = f m (u m ) + ]T h> f! n +1 {u m ) . (5.13) 

3=P 
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Lemma 5.8 (Embedding lemma for Gevrey class data). Assume that the semilinear 
evolution equation (jl.ljl satisfies conditions (A) and (BO-2). Let, as before, \&™ and 
ip 1 ^ denote a single step of a Runge-Kutta method subject to (RK1-3) applied to the 
full and the projected semilinear evolution equation, (jl.ip and (15.5[) . respectively. 
Then for S G (0,(5*] there exists ft* > such that the choices 



-3- i 



m(h) — (—r) 1+q and n(h) 



i 



(5.14) 



with x = S/(2erjCF) ensure that the modified vector field 

F(U;h) = 1^(^11; h), (5.15) 

where /™ is given by (15 . 13|) . is well defined as an analytic map from T> 1 S ^ 2 to y± 
for every fixed h G [0, ft*] and generates a modified flow $ : V^ 6 x [0, ft*] — > T>\ S ^ 2 . 

q 1 

Moreover, for every c* G (0, r 1 +i x q+1 ) there exists a constant such that for 
every U G D~ S L+1 and h G [0, ft*] 

\\* h (U)-* h (U)\\ yi <c i e" c *^ T * ? . (5.16) 

Proof. Set r = 5/4. As T>x is a bounded subset of J^i, there exists ro* such that, 
due to (E3]), \\Q m U\\ y < mr 1 \\U\\ yi <S/2-r and therefore P m U G V~ r for every 

U G 2? j and to > m*. In particular, for any such U and to, 

B^ yi (P m [/) = G P m y: \\u - P m U\\ yi < r} 

is contained in P m 2?i n 2? such that estimate (|5.12p holds true and we can apply 
Theorem 12.11 with M = cftu on this ball. This theorem asserts that for every 
h G [0,/io/4], where ho = r/{2&qcpm), the modified vector field f m — fm with 
n{h) = [fto/ftj is defined on B^T^ 1 (¥ m U) and analytic as map from B^J^ 1 (P m U) 

to 3^i. Its flow ft m satisfies ^ m (¥ m U) G B V r J^ x {¥ m U) for at least < t < h, and 
Uti^mU) - 4>l(P m U)\\ yi < h 7 c F me- h ^ h . (5.17) 

By construction, F m — f m o P m is analytic as map from T> 1 S ^ 2 to y± and $ m = 
4>m ° Pm is analytic as map from I?^ 5 to T> 1 S ^ 2 for any choice of to > to*. 

Our next step is to estimate the difference between the solution to the modi- 
fied projected equation and the numerical solution of the full semilinear evolution 
equation (|1.1| . We split the error into the projection and the embedding error, the 
first of which is controlled by Lemma 15.71 and the second is controlled by (|5.17p . 
By Theorems 14.11 and 15.61 respectively, there are ft* > and (a possibly increased 
choice of) to* such that, for h G [0, ft-*] and to > to*, ^ h and Vl/J^ are continuous 
maps from T>~ S L+1 to y r ,L+i and such that the exponential projection error esti- 
mate asserted by Lemma 15 . 71 holds . We might have to increase to* further to ensure 
that ho(m)/4 < ft*, so that the embedding error estimate (|5.17p holds true, each 
for every to > to* and h G [0, ha(m)]. 

Since T>~ L , 1 C T>i , splitting the total error into a projection error component 
and the embedding error on the finite dimensional subspace V m y, we obtain that 

\\* h (U) - 4>t(v m u)\\ yi < \\V h (U) - ^(F m U)\\ yi + Ut^mU) - 4>t(v m u)\\ yi 

<c^m- l c~ Tml/q +h-fc F me' ha/h 
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for all U £ h £ [0,/i*], and m > m*. The first error decreases with m 

whereas the second error increases with m. We now demand that the two exponents 
on the right coincide. Under the ansatz m = C,h~ a for some £ and a £ (0, 1), we 
obtain 

\\V h (U) - ^(F m U)\\ yt < h al C l e-« 1/th "" t + 7 c F C h x ~ a e-xC-'h"- 1 (5 . 18) 

withx = 8/{2erjCF)- Then the exponents coincide provided T^ 1 ' q h~ a ' q = xC -1 ' 1 " -1 ; 
i.e., when 



and 
1 + 9 

This implies that m(h) is given by (|5.14[) . and 



C=(~f- (5-19) 



W^V) ~ ti( h )(VmU)\\ yi < Ci^e-^"^ (5.20) 

with c = TC^I q = x^+t and = min{l, q£}/(l +q), and where we possibly 
need to shrink > to satisfy 

m(h*) > m* and /i» < - , - . 

Since ho — xto -1 = x 1 ^"'?" ^ , the latter inequality then guarantees that h < ho/4 
and yields the form of n(h) stated in f)5 . 14[) . The exponential estimate ()5.16[) is 
then obtained by defining the modified vectorfield by (|5.15[) with corresponding 
modified flow $'({/) = (f^r^ (W m (h) U) and dominating the algebraic prefactor by 
fractional exponential decay. □ 

Remark 5.9. The reason for requiring data in T> ~ S L+1 rather than T>~ S L is that we 
want to define ^ and V'm uniformly in h and m on general open sets, not just 
on open balls. This can only be done when constructing them as maps D~ d L+1 — > 
J^t.l+i n T> Tt L, see (23 EE]. Moreover, for general domains we can only maintain 
a valid domain of definition of the nonlinearity under Galerkin truncation and, in 
particular, assert estimate (|5.12|) only by dropping down at least one rung on the 
scale of spaces. 

Next, we show that in the Hamiltonian case the above construction also yields a 
modified Hamiltonian which is approximately conserved under the numerical time-/i 
map of the full semilinear evolution equation (j3 . 12[) . 

Lemma 5.10 (Modified Hamiltonian for Gevrey class data). Under the conditions 
and in the notation of Lemma \5.8l suppose further that (HO-4) and (RK4) hold 
true. Then there exists a modified Hamiltonian H : T> 1 x [0, h*] — > M, defined up 
to a constant of integration, which is analytic inU £'D 1 for every h £ [0, h*\ and 
such that the modified vectorfield from Lemma \5.8\ satisfies F — JJV-ff . Moreover, 
there exist a constants c*,c^ > such that for every U £ D~j +1 and h £ [0,h*], 

\H(* h (U), h) - H(U, h)\ < c n e" c *™ . (5.21) 

Proof. Let h £ [0, h*\ fixed. On each Galerkin subspace P m y, the situation is 
as in Section |2~31 i.e., the operator J _1 D/4(w m ) is self-adjoint with respect to the 
restriction of the J^i -inner product to F m y for each u m £ T>nP m y. As we argued in 
the proof of Lemma[EH P m U £ T) for U £ V~ S/2 , so that J- 1 D#([7) is self-adjoint 
with respect to the J^-inner product for each U £ T>^ 5 ^ 2 . By assumption (H4) 



APPROXIMATE ENERGY CONSERVATION OF TIME SEMIDISCRETIZATIONS 



27 



and Theorem 13. 6[ the set V l 5 ^ 2 is simply connected and star-shaped. Therefore 
Lemma 13.51 on the integrability of vector fields on Hilbert spaces applies on the 



domain T> 1 endowed with the 3^1 -topology. This proves the existence of an 
analytic modified Hamiltonian H : T> 1 — > R which is invariant under the modified 
flow such that 

F = WH. 

To prove (|5.2tp . we decrease /i* such that the right hand side of (|5.16[) is smaller 
than r/4 for every U G T^^l+i- Then, using the mean value theorem, the bound 
on the modified vector field given by (I2.6[) with M = cp m, (|5.20[) from the proof 
of Lemma 15.81 and the invertibility of J , we estimate 



\S(? h {U)) - H(*\U))\ < sup \\T>H(y)\\ yi \\* h (U)-* h (U)\\ yi 

yeP m y 
\\y-P m u\\ yi <% 



< 



sup ||jr7m(»)IUII* & (iO-*' l (^)l| yi 

yev m y 
\\y-r m u\\ yi <i 



< P" 1 \\e{y 1 )i 1 + 1-657?) c F mh»c $ e 

Now we note that JJ and A commute, see (|3.13[) . which, as J -1 G £ (y) by (HO), 
implies J~ 1 £(y i i). Since H is conserved under the modified flow, choosing m as in 
(|5.14|) . we obtain 

\H(¥ h (U)) - H(U)\ < ch u ~^+i e -^-^ 

This inequality implies (|5.21j) with c* > possibly decreased from its value in 
Lemma 15.81 □ 



What is still missing is a statement regarding the closeness of the modified to 
the original Hamiltonian. Ideally, we would like H and H to be 0(h p )-c\ose. In a 
first attempt to prove such a result, we write 

\H(U) - H(U)\ < \H(U) - H(F m U)\ + \H(¥ m U) - H(U)\ , (5.22) 

where m = m(h) is as in (|5.I4I) . Under the assumptions of Lemma [5.101 the first 
term on the right is exponentially small for U G T> t ,l+i- Indeed, noticing that m* 
and hence also m(h) > m* in the proof of Lemma 15.81 were chosen such that, in 
particular, V m U G V holds for all U G T>~ L+v we can estimate 

\H(U) - H(F m U)\ < l - \(Q m UJ- x AQ m U)\ + \V(U) - V(F m U)\ 

< l|I _1 ^ll£(y) WQmUfy + max|| W(£/)||y \\® m U\\ y 

< W^MsmiWQmUWy + wr'W^Mo) \\Q m u\\ y 

1/9 



<c H ||Q w ,U'||y<ci r m- l '- 1 e- Tm ||^b Tl+1 . (5.23) 



In the first inequality above, we used that J -1 and P m commute, cf. Lemma [ 
The second inequality is based on the mean value theorem and the third inequality 
holds due to the identity \7V(U) = ^ 1 B(U). The constant ch is finite due to the 
boundedness of the operators J _1 A and JJ -1 provided by (HO), and the boundedness 
of the domain 2? Tj ^ + i. With (|5.I4I) . the final line in (|5 . 23[) also exponentially small 
with respect to h. 
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To estimate the second term of (|5.22p choose some fixed U° € ck(X>i)- 5 and 
set H(U ) = H(U°). The naive choice is then to to employ Lemma \2.2\ with 
u m = f m U, = P m C/°, M = cftti where m is given by (15.141) , and r = 5/4 as in 



the proof of Lemma 15.81 This yields 



\H(u m ) - H m (u m )\ < 



(u m - u°„, J 1 (f m - f m ){u Q m + s(u - u£j)) ds 



I) 



< Aip. || j- 1|| r -v m p sup m _ hP 

= 0{h^r) (5.24) 



for every U £ T>^ . In the final step, we used that J and A commute so that 
ll-J ^IfC^i) = Gevrey class regularity does not improve this part of the 

estimate. 

Clearly, (|5.24|) is much worse than the expected 0(h p ). A superficial improve- 
ment might be obtained by "paying" with the exponent in (|5.2ip : for any e > 0, 
setting a = e/(p + 1) in ([5T9]) will yield 0(/i p ~ £ ) in (11x24)) . 

A closer inspection reveals that, in the context of the semilinear evolution equa- 
tion, the second inequality of (I2.8[) in the proof of Lemma 12.21 is too weak: when 
estimating the terms f 3 of the modified vectorfield in some fixed Hilbert space 
norm, the unboundedness of the operators A contained therein will introduce a 
factor mP, see (12.7[) . This propagates into the proof of f|5 . 24|) . Note, however, that 
these estimates are simply about consistency, not about constructing a flow. Thus, 
we can afford to lose smoothness rather than order. In other words, we can estimate 
the f J as maps from one Hilbert space into another with a weaker norm. This will 
be detailed in the next section. 



5.4. Backward error analysis on Hilbert spaces. In this section, we present 
a more subtle estimate on the difference between the original Hamiltonian and 
the modified Hamiltonian. The main difference to the derivation of (|5.24[) in the 
previous section is that we consider the expansion coefficients of the numerical flows 
and of the modified vector fields as maps between different rungs on our scale of 
Hilbert spaces such that the "loss of derivatives" is carefully accounted for. 

We begin by establishing the necessary functional setting for the backward error 
analysis on Hilbert spaces. We then review a result from [28] on the Galerkin pro- 
jection error for the the numerical time-ft. maps. This estimate is then propagated 
into an estimate on the difference between the full and the modified vector field, 
which finally implies a corresponding estimate on the difference difference between 
the exact Hamiltonian and the modified Hamiltonian. 

In this section, we work directly with the standard construction of the modified 
vector field. Namely, for I = 1, . . . , K + 1, we write 



(5.25) 

/i=n 



to denote the i?th coefficient of the expansion of § h in powers of h, and define the 
corresponding expansion coefficients for the modified vector field on Hilbert spaces, 
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setting F 1 = G 1 and 

Fl = Gi -ill E •••'>' (5-26) 

4 = 2 ' tl + ...+li=t 

for £ = 2, . . . ,K + 1, where the sum ranges over indices £i > 1 for all i, and where 
DjG = DGF 7 . We also recall from Section T5.3I that g l m and denote the £th 
coefficients of the expansions of the projected flow and the corresponding modified 
vector field, respectively, and set set G l m = g l m o P m and Ff n = /~, o P m . 

In the notation of condition (Bl), we fix S € (0, <5»] and abbreviate W K = P" 5 for 
K — I, . . . , K + 1. Then the regularity results Theorem 14. 1 1 on fy h and Theorem l5.6l 
on ^ n imply that, in particular, there exists to* such that for all to > to* and 

e = i,...,K + i, 

G e ,G e m e p| C{{U k ;y h -i). (5.27) 

j+k<N 

^<fc<if+i 

Moreover, bounds in the norms associated with (I5.27[) are uniform in m > m*. 

In the following, we will state such bounds on vector fields in terms of the three- 
parameter family of norms 

\9\n,k,s = g\\ c{Uk . £H y kyks)) (5.28) 

S<k<K 

for 1 < S < K < N. The parameter S plays the role of a "loss of derivatives" index 
as it forces the image of the map be estimated at least S rungs down the scale. We 
can than prove a simple result on the regularity of the expansion coefficients of the 
modified vector field. 

Lemma 5.11 (Modified vector field on a scale of Banach spaces). Assume that 
G 1 , . . . , G K+1 are of class (|ST27|) . Then the vector fields F l , F K+1 defined by 
(|5.26p are also of class (|5.27|) . 

Proof. The proof is based on the simple fact that F G C£ (D^+jj 3^) and G € 
C£ +1 (yj,y) implies that DG F e C£{y i+j ,y). Thus, it remains to observe that 
repeated application to the terms in the inner sum of (|5.26p causes all loss indices 
to always sum up to I. We proceed by induction in I. The case £ = 1 does not 
require proof. Assume therefore that £ > 1 and the lemma is proved up to index 
£-1. For I > £i > 1, we estimate, with G = T>t 2 . . . D^F^ that 

\^ei^\N,K+l,£ = |DGF £l \ Nt j{+l,t 

= \\V J (VGF^)\\ c{Uk;£Hyk ^ e)) 

e<k<K+l 

< 2 N max ||D : ' +1 G|L/,/ cy+ic-v v w max WD 1 ' F £l \\ rl1 j m w 

e<k<K+l £ 1 <k<K+l 

= 2N j+kKN+i-i, ^ 3G ^(u k ,eo(y k .y k _ {e _ Cl) ))\ Fll \NM+iM 
e-e 1 <k<K+i-e 1 

— 2 W \G\ N ,K+i-e. l i~i l W 1 \n,k+\,i 1 > (5.29) 

provided that G is of class (|5.27[) with K replaced by K — 1\ and I replaced by I— 1\. 
Here the first inequality is based on the product rule and selective weakening of 
the norm on the domain spaces, thereby increasing the respective operator norms. 
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The identity between the third and the fourth line is achieved by redefining j + 1 
as j and k — l\ as k. The final inequality holds because l\ > 1, so that we are 
strictly extending the range of the running indices. We note that the second term 
in the final line of (|5 . 29[) is bounded by the induction hypothesis. The estimation 
of the first term in the final line of (|5 . 29[) can now be done recursively to resolve the 
entire product • • • T>i i _ i F li from the inner sum of (|5.26j) in terms of quantities 
which are bounded by the induction hypothesis. This is always possible, because at 
the fcth step of this process we lose Ik derivatives, and the sum of the loss indices 
satisfies l\ + • • • + ti = I by construction. □ 

We now aim to derive estimates on the difference between F l and with 
respect to the same type of norm. In [28], we already obtained a related result on 
the difference between and which we can start from. Setting 1 = (0,/i„) 
and Li = T>] < s +1 , we define the norm 

11*11*,* = j ^J% d h*\\c~<UxX i £iO>K+i i y h - t )) 
l<k<K 

for < K < N. Then the the stability of the numerical method on a scale of 
Hilbert spaces under spectral truncation can be formulated as follows. 

Lemma 5.12 (gSJ Theorem 3.7]). Assume (A), (Bl), and (RK1-3). Choose 
S £ (0, (5»] small enough such that T>~^,^ is nonempty. Then there is h* > such 
that for every < S < K , 

\\y h -<\\N-i-s,K-s = 0(m~ s ) (5.30) 

as m — > oo. The order constants depend only on the bounds afforded by (Bl), 
(13.101) . on the coefficients of the method, and on 5. 

Remark 5.13. If N > K + 1, the statement as proved in [28 actually holds true 
with K + 1 in place of K. It is easier, however, to proceed with the slightly weaker 
version stated here. 

We note that Lemma 15.121 above already provided us with an exponential esti- 
mate on ^ h — for Gevrey regular data, whereas Lemma [5.121 here also asserts 
bounds on derivatives with respect to h] the proof is correspondingly more compli- 
cated even in spaces of finite order and can be found in |28j . 

To proceed, we observe that Lemma [5.121 holds with K in the statement of the 
lemma replaced by any k between S and K with K as defined in condition (Bl). 
Then, specializing to the particular value k = n — S in the definition of the norm 
appearing in (I5.30p . we obtain 

S+i<K 

Thus, taking the maximum over the allowed range k G S, . . . ,K, 

^-i^ 9 '^' 1 OWc~(u K xz l£H y K ;y M)) = 0(m~ S ) . (5.31) 

S+^<k<A" 

Due to the definition of G l in (|5.25l) , this directly implies that 

\G" ~ Gl\ N _ l<K<s+l = 0(m- s ) (5.32) 
for every I g 1, . . . , K — S. 
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Lemma 5.14 (Stability of the modified vector field). Suppose G 1 ,...,G K and 
G 1 ,...,G K are of class ()5.27[) . and let F and F denote expansion coefficients 
of the respective associated modified vector fields defined via (|5.26|) . Then, for 
S G 0, . . . , K and every £ € 1, . . . , K — S, 

\F e - F%_ ltKtS+t < c max \G k - G k \ N _ 1<KtS+k , 

where the constant c depends on the bounds on G k and G in the norm \ • | jv l,K,k 

fork 1 L 

Proof. The proof follows the same route as the proof of Lemma 15.111 We set 
T)gG = DGFg. The crucial estimate corresponding to (|5.29l) then takes the form 

\D h G - ^Mn-i^s+i < IDG (F £l - i^)|;v-i,K, S+ * 

+ MG-G)F^\ N _ liKiS+i 

— 2 W \G\ N _ 1K _ s _ ei i _ ei \F l1 — F ll \ N _ l KS+li 

+ 2 N \G — G\ N _ 1K _ il S+e _ ei \F 1 1 N _ i t K/i > 

where the estimates in the last inequality are derived just as in (I5.29[) . This rea- 
soning can again be applied iteratively to resolve the entire difference of products 
of the form T>i 1 ■ ■ ■T>i i _ 1 F l \ where we note that the loss indices now add up to 
exactly S + £ as required. □ 

We now turn our attention to the optimally truncated modified vector field 
F = f m oP m where m = m(h) is as in (|5.14|) . This is the same modified vector field 
which gives rise to the modified Hamiltonian in Lemma 15.101 Then the difference 
between F and F, the exact vector field of the semilinear evolution equation (|1.1|) . 
can be estimated as follows. 

Lemma 5.15. Suppose conditions (A), (BO), (Bl), and (RK1-3) are satisfied with 
K > a+b, where a and b are given by (|5.1[) . Then for every 6 € (0, 8*], the difference 
between the original vector field F and the modified vectorfield F from Lemma \5.8\ 
satisfies 

\\F-F\\ Cb(v ., b . y) =0(hn. (5.33) 

Proof. We define two intermediate vector fields. Let F a denote the modified vector 
field of the numerical method applied to the original semilinear evolution equation 
(|1.1[) computed up to order a. Formally, as in the finite dimensional case, it has an 
expansion of the form (|2.4I) , 

a-l 

pa = F + H h ° pj+1 ■ ( 5 - 34 ) 

j=p 

Note that F — F 1 . Due to (|5~27|) and Lemma [5TTT1 all coefficients F k in the 
expansion (|5.34l) and consequently F a are also of class (|5.27l) with I = a < K . 

We now choose h* and ro* as in the proof of Lemma 15.81 and suppose m > to* , 
so that the projected modified vector fields are well defined. Let F^ = o P m 
denote the corresponding modified vector field of the projected system (|5.5p , again 
up to order a, with m — 0(h^ q ^ q+1 ^) as in (|5 . 14|) . By Lemma f5 . 1 II applied to G^, 
the vectorfield F^ is also of class (|5.27p . 
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We now decompose 

F — F = (F — F a ) + (F a — + (F» - F) . (5.35) 

We show that when a is chosen as in (|5.1|) . each term on the right is 0(h p ) in 
appropriate norms. 

A bound on the first difference on the right of (I5.35P follows directly from the 
definition of F a in ([5~34| . Using the norms defined in (|5.28p . 

a-l 

\F F a \ N _ hK>a <hPj2 h j - p \F j+1 \ N _ hK , a . 

3=P 

By Lemma l5.1H all norms in the right hand sum are finite, so that, for a G 1, . . . , K , 

\F-F a \N-i,K, a =Om. (5.36) 

To estimate the second difference on the right of (|5.35j) . we apply Lemma \5. 141 
with G e — G~j, which we recall are also of class (|5.27|) . so that (|5 . 32f) applies, 
yielding 

" F^\ N _ ltKa+s = 0(m' s ) (5.37) 

for Se Q,...,K-a. 

To estimate the third difference on the right of (|5.35|) , as in the proof of Lemma l5~8l 
we choose r = S/A and to* large enough such that for each U G T> 1 , the ball 
B^ yi (V m U) is fully contained in V so that f m is well-defined onPn P m X>i. 
Then estimate (|5.12j) holds true and so the bound (|2.7j) holds for each of the 
P m with M = cp to on this ball. We now regard the modified vector field f m (h) 
with m(h) given by (|5. 14|) as the modified vector field for the differential equation 
Urn — /m(/i)( Um ) wnen ^^(ft.) ' s use d as its 0(h a ) time-discretization. In this sit- 
uation, Lemma 12.21 with a in place of p applies. Moreover, this construction holds 
true with uniform constants for any U G V 1 S ' 2 , so that, recalling F^ n = ff n o P m 
and F = f o P m , the asserted bound reads 

H^W " K(v-^^) = ^ Mh) a+1 ) ■ (5.38) 

We now seek conditions under which the estimates (|5 . 36[) . (|5 . 3T[) . and (|5.38[) 
are all of order h p . Due to (I5.14|) . m(h) = 0(h^ q ^ 1+q ^), so that the requirement 
(m(h))~ s = 0(h p ) leads to the choice 

1 + 



5' 



V 



q 

where b is as in (15.11) . Similarly, the requirement h a m a+1 — 0(h p ) is equivalent to 
0(h a m a+1 ) = O^-^FT 1 ) = 0{h^-^r) = Q{h p ) 



and leads to the choice of a stated in (|5.1|) . namely 

a = \p(q + 1) + q] ■ 

Altogether, using (I5.35[) , recalling that K > a + b, and (|5.37p with S = b in 
(15371) . we obtain (jOBl . □ 

Remark 5.16. We can change F so that its leading order linear part is A rather 
than AP m (7j) since, by p.5p . for U° G y r .i the difference between the two modified 
flows are exponentially small in the 3^-norm. 
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In the Hamiltonian case, the previous result on 0(/i p )-closeness of true and mod- 
ified vectorfield carries over to a statement on 0(/i p )-closeness of the corresponding 
Hamiltonians. 

Lemma 5.17. Under the assumptions of Lemma \5.15\ suppose that, in addition, 
the semilinear evolution equation is Hamiltonian satisfying (HO-4) and that the nu- 
merical method is symplectic (RK4) and of order p. Then the modified Hamiltonian 
H from Lemma \5. 10\ can be chosen such that 

\\ H -H\\ Cbiv -s bm = o(hn. 

Proof. By (H4) and Theorem EH V~l b is star-shaped. Let U° E ck(V~* b ). Since 
the Hamiltonian from Lemma 15.101 is defined only up to a constant, we can choose 
the constant of integration such that H(Uq) = H(Uq). Then, following the proof of 
Lemma EiH we estimate, for any U € ^a+b' 



\H(U)-H(U)\< 



-\F - F)(t U + (l-t) U ), U - U ) dt 



n 



^We^WF-n ^ sup \\U\\ y . 

Since T> a +b is a 3^-bounded set, Lemma 15.151 implies that the right hand side is 
O(hP). □ 
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